Cerebras-GPT 13B is a large language model trained based on an open architecture and dataset. It belongs to the Cerebras-GPT series and aims to study the scaling laws of large language models and demonstrate the simplicity and scalability of training on the Cerebras software and hardware stack.
Large Language Model
Transformers English